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Abstract. An estimate is derived for the volume fraction of a subset C"^ = {U : \\ grad J(L'')|| < 
e} C U(A'') in the neighborhood of the critical set C'' ~ U(n)yU(m) of the kinematic quantum 
ensemble control landscape J{U) = Tv{U pU^ O), where U represents the unitary time evolution 
operator, p is the initial density matrix of the ensemble, and O is an observable operator. This 
estimate is based on the Hilbert-Schmidt geometry for the unitary group and a first-order ap- 
proximation of II grad J()7)|p. An upper bound on these near-critical volumes is conjectured and 
supported by numerical simulation, leading to an asymptotic analysis as the dimension N of the 
quantum system rises in which the volume fractions of these "near-critical" sets decrease to zero as 
A'^ increases. This result helps explain the apparent lack of influence exerted by the many saddles 
of J over the gradient flow. 



1. Introduction 

Control landscapes are proving to be valuable for providing insights into quantum optimal control 
theory [1]. A simplification can be achieved by observing that the dynamical landscape J : K — >• M 
(i.e. the objective functional) - a map taking a control function as input and producing the value 
of the observable at some final time T - can be written as a composition of a kinematic landscape 
J and a controli— s-propagator map Vt- Here the kinematic landscape J : U(A^) — t- M is a smooth 
real-valued function on the unitary group, and : IK — t- U(A^) is defined implicitly through the 
Schrodinger equation and returns the unitary time evolution operator at the final time T for each 
given input control function. The goal of quantum optimal control is generally to maximize the 
dynamical landscape J = J oVt- 

The latter basic landscape decomposition has been applied to various quantum control objectives, 
including state-to-state transitions [2-4], general quantum mechanical observables on an ensemble 
[5-7], and unitary transformation (quantum gate) preparation [ - ]. This area of research has 
focussed on identifying and characterizing the critical points of the landscapes, revealing important 
features of the associated gradient fiows. In particular, this work has shown that the kinematic 
landscapes have no suboptimal extrema that can act as traps preventing the gradient fiow from 
reaching a global optimum. 
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The present paper considers one aspect of the optimal control of a quantum mechanical observable 
on a A^-level system. Such observables can be defined for an ensemble of initial states (whose 
collective state is described by a density matrix p) by the kinematic landscape J{U) = Tr(C/ pU^O), 
where the observable is represented by a Hermitian matrix O. It is known that, in addition to a 
single global minimum submanifold and a single maximum submanifold, this kinematic landscape 
generally contains a large number of saddle submanifolds [/]. The number of saddles depends on 
the eigenstructure of p and O and generally increases with the size A'^ of the quantum system. By 
continuity, each saddle submanifold is surrounded by a neighborhood in which the norm of the 
gradient is small. Any trajectory of the gradient flow entering such a neighborhood will be slowed 
by this small gradient and reach the global maximum less efficiently than trajectories that never 
enter such neighborhoods. However, since the gradient flow is not attracted to saddles as it is to 
local maxima, the magnitude of the impact of these saddles on the overall gradient flow of this 
landscape is unclear. 

With this perspective in mind, we derive estimates and bounds for the effective volume fractions 
of the critical submanifolds relative to the volume of the unitary group. The term "effective volume 
fraction" is used here to mean the volume fraction of a region around a given critical submanifold 
in which the norm of the gradient is small, i.e. less than some e > 0. These volume fractions will 
serve as an indicator of the likelihood of a gradient integral curve falling under the influence of the 
saddles. This influence can have a profound impact on the efficiency of a gradient ascent algorithm 
(whether implemented in the laboratory or in numerical simulations), or indeed any optimal control 
algorithm based on local information. 

Additionally, we consider the asymptotic behavior of these volume fractions as the system size 
N is allowed to increase. By embedding the original p and O in these larger spaces, this analysis 
addresses the role of truncation to finite N in defining the characteristics of the gradient flow. In 
particular, this research will show that the volume fractions tend to zero as — t- oo, implying that 
the saddles should have a vanishingly small impact on the gradient flow. This conclusion helps to 
explain the observed behavior of numerical quantum optimal control simulations, which show no 
evident increase in search effort as N rises. 

The paper is organized as follows. Section 2 reviews the structure of the critical submanifolds of 
J{U) = Tv{UpWO). The induced Hilbert-Schmidt measures of these submanifolds are computed 
in Section 3. The near-critical sets are described in Section 4, yielding estimates of the volumes of 
these sets in terms of a Haar measure. Upper bounds for these volumes are developed in Section 
5 and the asymptotic behavior as — )■ oo is considered. The results are summarized Section 6. 
Four appendices are included that detail the Chevalley lattice of direct products of Lie groups, the 
Hessian operator of J, the second fundamental forms of the critical submanifolds of J, and provide 
supporting arguments for a key conjecture. 
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2. The Critical Submanifolds of the Landscape 

The landscape J{U) = Tt{UpU^O) is defined on the unitary group U(A^), where the density 
matrix p and the observable O are both N x N Hermitian matrices. Without loss of generality, 
we will make the simplifying assumption that p and O are both diagonal, with monotonically 
descending diagonal elements. This may be done since passage from the landscape based on p and 
O to the one based on their diagonalized forms represents merely a translation over the unitary 
group. Since the metrics considered in this paper are bi-invariant, this translation may be neglected. 

It will be fruitful at certain points in the analysis that follows to consider U(A^) (with the 
geometry induced by the real Hilbert-Schmidt inner product on C^^^) as a Riemannian glob- 
ally symmetric space. This will be done by fixing a permutation matrix CP G and letting 
g ■= U(iV) (S> \J{N) and /Cy := {(y,a'tyT) : V G \J{N)}. Then, under the involutive analytic 
automorphism a{V,W) = (J'l^lP"^, IP'^yCP), is a symmetric pair and Q/lCy is a Riemann- 

ian globally symmetric space under any t/-invariant metric. The map Q/tCy — >■ U(A^) given by 
(y,VF)/Cy ^ V?W^ is clearly smooth, as is the section U ^ S ince the projection 

G — ^ G/JCy is a smooth submersion, VPW^ ^ (I, W?W^?) ^ {V, W)ICt is a smooth inverse to the 
map {V,W)ICj> I—)- VyW^ , so this map is a diffeomorphism between Q/lCy and \]{N). The group 
action of Q on U(A^) induced by the diffeomorphism is then given by {V, W) ■ U = VUW\ and the 
real Hilbert-Schmidt metric on U(A^) is ^-invariant, so U(A'^) is a Riemannian globally symmetric 
space under this structure. The projection $y : ^ — )• U(A^) ~ Q /K,y can also be viewed as defining 
y as a fiber bundle with base U(A^) and fiber K.y. 

Now, let H := U(m)eU(n) C G, where U(m) = U(mi)e- • •eU(ms), U(n) = U(ni)e- • •eU(nr), 
and {rrij} and {nj} are the multiplicities of the unique eigenvalues of O and p, respectively. Then 
the critical set of J{U) = TriUpU^^O) has been shown to comprise a disjoint union of submanifolds 
of U(iV), each of the form <^y{n) [ ], where = U(m)a'U(n) is the orbit of the point ? G 

Sn C U(A'') with respect to the induced action of Ti on U(A^). So $j>(^) = Ti/ Stab-H(lP), where 
the stabilizer Stab-^(;P) is the subgroup of Ti given by 

Stab^(y) =nn]C7 = {{V,W) e U(m) e U(n) : ^j>{V, W) = V7W^ = 7} (1) 

and is identified with U(K) ^ U(m) n TU(n)?t in [ ], i.e. Stab(T) = {{CP^C'^) ■ C e U(m) n 
CPU(n)yt}. Here K is the r X s "contingency table" corresponding to m, n, and IP. For a given 
permutation CP, U(m) © U(n) can be expressed as a fiber bundle [ ] as 

U(m) n yU(n)yt U(m)©U(n) 

Orb(?) C U(iV). 

We seek to compute the "volume" (measure) of Orb(CP) as an embedded submanifold of U(A^), 
where U(A^) is given the Riemannian metric induced by the real Hilbert-Schmidt inner product on 
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£^NxN ^ "We will see that this problem reduces to one of computing the volumes of the Lie groups 
U(K) and U(m) U(n) with respect to certain specific geometries. 

Lemma 1. For any permutation matrix T G Sn, there exists a !P G Sn Pi Orb (IP) such that 
U(m) n lPU(n)y"l" = U(K) in terms of block- diagonal structure and U(n) n lP'l'U(m)y = U(K) = 
J'^U(K)y where U(K) has essentially the same block structure as U(K), but the blocks are reordered. 

Proof. For any S G S^v n U(m) and F G ^tv n U(n), ? = STPt e Sn n Orb(?). Then U(m) n 
yU(n)J'1^ = S^^(U(m) n 5'U(n)3'^)S, so that S may be chosen to reorder the rows and columns of 
each diagonal block of U(m) to group the matrix elements of U(m) fl J'U(n)J''l" according to the 
block of U(n) from whence they came, creating sub-blocks within each block of U(m). This is 
exactly the structure desired for U(K). Similarly, U(n) ny^U(m)y = r(U(n) n 5''l'U(m)J') , so that 
r may be likewise chosen to reorder the elements in each diagonal block of U(n) according to the 
originating block of U(m), constructing U(K). □ 

Without loss of generality, the remainder of the paper will use the permutation 7 described in 
the above lemma, so that U(m) n a'U(n)?t = U(K). 



3. The Hilbert-Schmidt Measure of the Critical Submanifolds 

We now turn to the problem of computing the volumes of the critical submanifolds. To that end, 
we will choose geometries for U(K) and U(m) © U(n) such that there is a local isometry between 
U(m) © U(n) and Orb(y) © U(K). Then it will be shown that the volume of Orb(J') is just the 
quotient of the volumes of U(m) © U(n) and U(K) under these specific geometries. 

Let A,B,C denote the following subspaces of u(m) © u(n): 

A = {{X^T^XT) : XGu(K)} (2a) 
B = {{X,-7^X7) : XGu(K)} (2b) 
C = {A®B)^ = {{Y, Z) : Ye u(m)/u(K) and Z e u(n)/u(K)}. (2c) 

It may be readily verified that these spaces are mutually orthogonal in the Hilbert-Schmidt geometry 
and span u(m)©u(n). We extend them by left translation to form mutually orthogonal distributions 
over U(m) © U(n) that span the tangent space at each point. 

Lemma 2. The distributions A, B, and C are right invariant with respect to Stab(5'). 

Proof. First observe that for any (" G U(K), X G u(K) if and only if C^C^ G u(K). Likewise 
Y G u(m)/u(K) if and only if {Y, X)us = for all X G u(K), if and only if {(YC^ C^C'^)hs = for 
all X G u(K), if and only if (Y^^ G u(m)/U(K) and similarly for Z G u(n)/u(K) with respect to (. 
In other words, the subspaces of u(m) © u(n) given by evaluating the distributions A, B, and C at 
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the identity are invariant under the adjoint action of Stab(J'). Then, for any g Stab(CP) 

and {V, W) G U(m) U(n) 

MwyiCytc?) = m W) ■ (C, yk^) ■ (X, -P^XT : X e u(K)} (3a) 

= {{y, w) ■ (X, ytxy) . (c, yt^y : x e u(k)}, (3b) 

which is the right translation of A(y^w) to (F, T^) • {(,7'^'?), so that A is right invariant with 
respect to Stab(J'). Similar arguments apply to B and C. □ 

Now, denote by $g> the restriction of toH = U(m) © U(n), and consider the images of these 
three distributions A, B, and C under d$y. Since for any {V, W) G U(m)©U(n) and any {5V, SW) G 
T(y_ty)U(m) © U(n), there exists {X, Y) G u(m) © u(n) such that {dV, 6W) = {VX, WY), 

d(v,w)^ASV, SW) = d^v,w)^AVX, WY) = VX^W^ - V7YWK (4) 

Then the kernel of di^v,W)^J' is {{VX,WY) G T(y^py)U(m) © U(n) : Y = 9^X9}, which is 
exactly the left-invariant distribution A evaluated at {V, W) . When acting on an element (V, W) ■ 

(X, -ytXT) G B(^Y^w), d(^v^w)^7 yields 2VX'J'W'', and when acting on an element {V, W) ■ {Y, Z) G 
C{v,W)^ d(V,W)^y yields VYVW^ -VVZW^ . Then, since X G u(K) and Y-VZ^^ G u(iV)/u(K), the 
d(y^^)^>g> images of B^y^yy-j and C(v,vK) are orthogonal complements under any bi-invariant metric 
on U(A^), in particular the Hilbert-Schmidt metric. Furthermore, for Xi,X2 G u(K), Yi,Y2 G 
u(m)/u(K), and ^1,^2 G u(n)/u(K), 

(di$a,(Xi, -ytXiy),di6y(X2,-a'tx2a'))Hs = 2{{x,,-'?^Xi9),{X2,-t^X2T))us (sa) 

(didy(yi, Zi), dni3>(y2, Z2))m = ((n, Zi), {Y2, Z2))hS (5b) 

since TZia't,a'Z2Tt G (yu(n)a't)/u(K) C u(iV)/u(m) arc orthogonal to Yi and Y2 in the Hilbert- 
Schmidt metric. So the image through d^y of A is zero, the restriction of d^y to B is two times a 
linear isomctry, and the restriction to C is a linear isometry. 

Since $y is a fiber bundle with base Orb(J') and fiber U(K), for any Uq G Orb(J'), there 
exists a neighborhood Q of Uq and a smooth local section jq ■ Q ^ U(m) © U(n) such that 
^vilQiU)) = U for all U e Q and such that Im (d[/7Q) = B^^(ij) (B C^q(u), the orthogonal 
complement of A^q(u) = ker (^d^^^jj^^y) in T^Q([/^U(m)©U(n) with respect to the Hilbert-Schmidt 
metric. Let -^q -.Q® U(K) ^y^{Q) C U(m) © U(n) be defined by 

^Q{U,0:=lQ{U)-{C,'3'k'?)- (6) 

Then 

d^u,o'^Q{6U,5C) = iq{U) ■ {5C.yHcy) + duiQm ■ {^.9^9) (7) 

where it may be observed that ^q{U) ■ {6C, y^^CJ") G by left invariance and the fact that 

((5C, 9'^ 6(7) G ^(^^yt^y) and also du'tqiSU) ■ ((", 'P'^C'^) G S^q(u,0 ®C,i,^(^u^q by right invariance and 
the definition of 7q whereby du^qiSU) G B^^^p^ ®^iq{U)- 
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For any given we now define a Riemannian metric (•, •)y on U(m) © U(n) at the point T = 
{V, W) as follows. For any <5Ti, JTa € TT(U(m) © U(n)), let 

{5TuSr2h ■■ = liSTf, 5T^>jjg + (dT'l'T(5Tf + 5Tf), dx'&T(<5Tf + 5Tg))jjg (8a) 
= ^(5Tl^,5T^>Hs + 2(^Tf,5Tf)jjg + (5Tf,5Tf>^g, (8b) 

where we have used the orthogonal decomposition 6T = 5T"^ + 5T^ + dT^ of 5T into the three 
orthogonal subspaces given by A, B, and C. Then for any U £ Q and ( G U(K), 

= lilQiU) ■ {6Ci,'J'^6Ci'J'), 7q(C/) • (<5C2,yt5C2y))HS 

= <<5Cl,'^C2>HS + {d^Q^u)^7{dulQ{SUi)), d^^(f;)4>y(d{77Q((5[/2))>Hg (9b) 

= {SCi,K2)us + {SUi,6U2)^s, (9c) 

where the last step follows from the fact that $y o is the identity map on Q. So, if we endow 
U(K) C C^^^ with the Riemannian metric induced by the real Hilbert-Schmidt inner product on 
^NxN ^ then is an isometric diffeomorphism between Q © U(K) and <ly^(Q) C U(m) © U(n). 
It then follows that VoIhs(Q) VoIhs (U(K)) = Voly (l>-^(Q)), and therefore 

, Voly (U(m) © U(n)) , , 

VoIhs OrbCP ) = ,\ 10 

^ ^ VoIhs (U(K)) ^ ' 

so the problem reduces to computing the ratio of the volumes of the two Lie groups with respect 
to the indicated geometries. 

The expression in (10) may be simplified further by considering the volume form on U(m)©U(n) 
induced by the metric (•, •)g> [^'^]- This volume form can be realized by choosing orthonormal bases 
of A, B, and C at (1,1) in the metric and extending them to orthonormal vector fields by 

left translation. Denote these fields by {Fi} where i = 1, . . . ,d where d = + S '^i • Then 

construct the dual basis of 1-forms oji = {Fi, •)y and the volume form by A • • • A w^^. Let 
K = Yli^lj ~ II-^IIhS' Because of the relationship between and the Hilbert-Schmidt metric 

described in (8), if Fi, . . . , is the basis for A and F^+i, . . . , F2k is the basis for B under (•, •)g>, then 
^Fi, . . . , and V2F«+i, . . . , V2F2 K are the corresponding orthonormal vector fields under the 

Hilbert-Schmidt metric. So \/2a;i, . . . , v^Wk and ;^a;K+i, • • • , :^'^2k are the corresponding 1-forms 
under the Hilbert-Schmidt metric. As a result, the volume form w on U(m) © U(n) induced by the 
Hilbert-Schmidt metric is identical to w, the volume form induced by (•, •)j>: 

Cj = V^uji A - ■ ■ A V2uJi^ A -^Wk+i a • • • a A uj2k+i A ■ ■ ■ AuJd = oj. (11) 
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So, while (•,•)? defines a diff'erent geometry on U(m) © U(n) compared to the Hilbert-Schmidt 
metric, stretching some dimensions and shrinking others, the result is no net difference in the 
volume form and therefore no difference in the volume under these two geometries. So we may 
replace (12) with the expression 

, VoIhs (U(m) © U(n)) , ^ 

VoIhs ( Orb ? ) = ^ \ \ , 12 

^ ^ ^' VoIhs (U(K)) ^ ^ 

which reduces the problem to one of computing the volumes of U(m) ® U(n) and U(K), both just 
direct sums of unitary groups, under the Hilbert-Schmidt metric. 

Lemma 3 (The Volume of U(a)). Let a € N'' 6e any vector of non-negative integers. Then the 
volume o/ U(a) = U(ai) © • • • © U(afe) with respect to the Riemannian metric induced by the real 
Hilbert-Schmidt inner product is given by 

VoIhs (U(a)) = ^ ^ . (13) 

nin.Lo 

Proof. To apply Macdonald's formula for the volume of a Lie group [13, 14], we need the basis for 
the Chevalley lattice given by the vectors {tj}, {£,a}, and {rja} in Appendix A. These vectors are 
all mutually orthogonal in the real Hilbert-Schmidt inner product on U(a). In addition, the r^'s 
have norm 1, and the ^^'s and rjaS have norm \/2. Letting = ^0^, the Gram matrix of this 
basis is then a diagonal matrix with a entries equal to one and a/(a« — 1) entries equal to two, 
so that the volume of the fundamental cell is just 

A(fl/gz) = 2I '^'('^'-1) = 2i '^^-i . (14) 

Then Macdonald's formula gives the volume of U(K) as 

a; — 1 

VoIhs (U(a)) = 2^E'"M J] [] Vol (S^^+i) (15a) 

n 1 (15b) 

n VoIhs (U(aO). (15c) 
I 

□ 



(27r)lE'^i-2 



I s=0 

2 I a 
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With this lemma, we can now compute the volmne of the critical submanifold Orb(IP): 

VolHs(U(m) e U(n)) _ H VoIrs (UK)) VoIhs (U(mj)) 



VolHs(Orb(a')) 



VoIhs (U(K)) 

(2vr)5E"f+|E-2+7V 



n„ VolHS (U(%)) 



r=0 



(2vr) 



2 ^'^ij 



JV 
' 2 



n,,np=Vp!n,nr=o"^! 



(27r)^ 



rm, — 1 , ' 



n.n;roV!n,n;':fo"^9! 

where d = ^ + XI ~ X] ^i^i dimension of Orb(J'). 



(16a) 
(16b) 
(16c) 



3.1. Examples. 



Example 1 (Maximum Submanifold of Pi^j). For any two (non-zero) vectors \i) and \ f) in 
let p = and O = jj^- Then J{U) = Ti:{UpU^O) represents the transition probability from 
initial state \i) to final state \ f). Translating the problem by diagonalizing p and O and sorting 
the eigenvalues in decreasing order, it is found that p = O has a single "1" in the (1,1) element 
and zero elsewhere. The N'^ — 2N + 2 dimensional maximum submanifold then corresponds to the 
identity permutation and yields the following contingency table [ ] and volume: 





mi 


= 1 


m2 


= N - 


1 


Ui 


= 1 


hi 


= 1 


h2 


= 




n2 


= iV- 1 


hi 


= 


h2 


= N- 


1 



VoIhs Orb (I) 



(2vr) 



|(Af2-Ar+2) 



nN-2 
P=0 P 



(17) 



Example 2 (Minimum Submanifold of Pi-^f) 

that fails to align the non-zero eigenvalues of p and O, then Orb(T) is the N"^ 
minimum submanifold, with contingency table and volume: 



If, in the previous example, a permutation is used 

2 dimensional 





mi 


= 1 


m-2 


= N- 


1 


ni 


= 1 


hi 


= 


h2 


= 1 




n2 


= N-1 


hi 


= 1 


h2 


= N- 


2 



VoIhs Orb(y) 



(27r)i(^'+^-2) 



(18) 



Example 3 (Fully Non-degenerate p and O). In the case where p and O are both fully non- 
degenerate, mi = ■■■ = m^r = 1, m = ■ ■ ■ = UN = 1 and K = CP^^. Then for any critical 
submanifold Orb(J'), all of which are N dimensional tori, 

VolHsOrb(T) = (27r)^. (19) 



4. The Measure of the Near-Critical Set 

In this section, an estimate is derived for the measure of the near-critical set = {U : 
II grad J(C/)|| < e} C U(A^) surrounding the critical submanifold Orb(CP). To do that, we first 
approximate the near-critical set by an ellipsoidal tube about Orb(y). 
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4.1. Approximating the Set of Interest. Let U S Orb(lP) be a critical point of J, and let 
X G u{N) be such that ||X|| = 1 and UX G (T^/ Crit( J))"^, i.e. UX E Tu\J{N) is orthogonal to 
the null space of Hessj^c/ : Tu\J{N) — )• T[/U(A^) (the Hessian operator of J at the point U G U(A^)) 
since J is Morse-Bott, the necessary properties having been established in [ ]. Define Fx : M — )• M 
by 

Fx{s) = \\ gvad J {Uexp{sX))f. (20) 

Then Fx{0) = and 

^ = 2(grad J(C/exp(sX)), Hessj,c/exp(.x) (C/exp(sX)X)) (21) 
so that (dFx/ds)(0) = 0, since U £ Crit(J). Furthermore, 
d'Fx 



ds2 



2( Hessj^[/exp(sX) (U exp{sX)X), B.ess j^u eM^x) {Uexp{sX)X) 



+ 2/ gradJ(C/ exp(sX)), V[7exp{sX)x Hessj^,7cxp(sX) (f/ exp(sX)X) \ (22 



sothat (d2Fx/ds2)(0) = 2\\Ress,iu{UX)f . Thus, for small s, Fx (s) = s^\\Ressj^u{UX)f + (D{s^). 
Then in order to have || grad J(C/ exp(sX)) || < e, we should have Fx{s) < e^, and therefore 

Now, suppose that {UYi} are the orthonormal eigenvectors of Hessj^f/ corresponding to non- 
zero eigenvalues {/3j} (see Appendix B). Then each normalized UX G (T(/ Crit( J))"*" is such that 
X can be written X = Y^OiYi with Yl'^l ~ 1- Then the condition on s is that < ^%^, , 
i.e. X](sai)^/^j^ ^ so that (sai, . . . ,sam) is a point in the m-dimensional solid ellipsoid with 
principal axes {e/|/3i|}. Therefore sX lies in the m-dimensional solid ellipsoid with principal axes 
{(e/|/3i|)yi}, and the set of all ?7exp(sA) for sX in this ellipsoid is an m-dimensional geodesic 
ellipsoid in U(A^), which we will denote by ^^(^7). Repeating this analysis at every point U of 
the critical submanifold Orb(J') and drawing together the resulting geodesic ellipsoids yields an 
ellipsoidal tube Te = U(;gOrb(T)^e(f^) about Orb(!P) that approximates the set of points for which 
II grad J|| < e. It is this tube of near-critical points whose volume we will estimate. Figure 1 offers 
one simple example of such an ellipsoidal tube about the submanifold of M^. 



4.2. Volumes of Tubes. The study of the volumes of tubes goes back to 1939, when Hermann 
Weyl [15] gave the first complete description of the volumes of spherical tubes about submanifolds 
of Euclidean and spherical spaces. These volumes were presented as a finite power series in the 
radius of tube, with coefficients derived from the geometry of the submanfold. This beautiful result, 
frequently referred to as Weyl's tube formula, has been successfully extended to certain other very 
special spaces (e.g. projective spaces), but appears not to have been extended to Lie groups or to 
the unitary group in particular. However, we may still make use of an infinite power series [16] to 
approximate the desired volume. 
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Figure 1. Schematic rendering of a simple example of an ellipsoidal tube about 
a circular 1-dimensional submanifold of M^. The ellipsoidal tubes considered in 
this paper are of a similar nature, but more complex, being tubes about higher 
dimensional (and more topologically and geometrically interesting) submanifolds of 
U(iV). 

We will closely follow the notation, definitions, and conventions in [16]. Let exp^^ : v — )• U(A^) 
denote the Riemannian exponential map from the normal bundle v of Orb(IP) to U(A^), i.e. exp^ 
takes a point (f7, V) ^ u with V G (Tu Orb(J'))''~, and outputs the point in U(A^) found by following 
the constant speed geodesic ^ with ^(0) = U and ^'(0) = V out to ^(1). Let yi, . . . ,yd he a local 
coordinate system for Orb(J'). Let xi, . . . ,X]\i2 be the Fermi coordinates [16] on U(A^) generated 
by yi,...,yd and the orthonormal fields E^+i, ■ ■ ■ , E]\f2 on Orb(J') defined as the orthonormal 
eigenvectors of the Hessian of J corresponding to non-zero eigenvalues, as in (51), 



N- 



Xi 



j=d+l 



Note that the vector fields {Ei} track the principal axes of the ellipsoid normal to Orb(!P). For 
each i G {1,...,A^^}, let Xi denote the vector field The geodesic ellipsoidal tubular shell 

characterized by the "radius" r, denoted <£r, is described in this local coordinate chart by the set of 
points for which YliLd+i ^fl^i ~ ^ ^ X(U(A^) — Orb(3')) denote the (outward) unit normal 

vector field to the shells (Br, let L denote the diff'erential 1-form L = {L, •), and let /x denote the 
volume form of Then L A /u = w, where u is the volume form on U(A^). Likewise let Ly be the 
unit normal vector field to the e-ellipsoid in (Tu Orb(T))''' and let Li, be the 1-form L^, = {L^, •). 
Let flu denote the volume form of the ellipsoidal shell in (Tu Orb(J')), so that A fiu = where 
UJu is the volume form on the normal bundle i'. 

We would like to compare exp*(I/), the pullback of L to u, with Li,. To that end, observe that 
at a point V = tiEi in the normal space (T[/ Orb(J'))''~, the tangent space to the ellipsoidal 
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shell through V is the set {X]£d+i where Yl!i=d+i'^i^iP'i ~ ^- Then the unit normal is 
L^{U,V) = ' G (Tc7 0rb(y)) . At the corresponding point W = exp^(C/,y) G U(iV), 

the tangent space to the geodesic ellipsoid is the set 



(24) 



where Ylf=d+i'^i^if^i ~ Then L(W) is the unit vector in TvyU(A^) normal to this space, i.e., 
Yli=d+i'^i{^^^i) = {'^i} s^ch that YliLd+i'^i'^if^i = 0- This implies that {L,Xi) = ctij3f 

for all i G {d + 1, . . . , A^^} and some fixed normalization constant c. Let Z G (T;/ Orb(J')) 
be perpendicular to L [i.e., tangent to the ellipsoid through {U^V)]. Then by (24) dexpj^(Z) 
is tangent to the geodesic ellipsoid through W, so that ((dexp^)*(L), Z) = (L,dexp^(Z)) = 0, 
meaning that (dexp^)*(L) = cLy, where it can be shown that c = ||(dexp^)*(L)|| = 1 + O(e^). 
Hence = (1 + 0{e^)) exp* (L). 

With the above material in mind, and taking (W{U) to be the volume measure of the ellipsoid 
within (T^ Orb(IP)) and dP to be the volume measure of Orb(!P), we find that the pull-back of 
uj is [16] 

exp*(L) A exp*,(/x) = exp:(c^)(C/, F) = a;(Xi, . . . , ^^2) ( exp,(C/, V))uj,{U, V) 

= Lo{Xi, Xjv2) (exp,(C/, V))L, A ^,(C/, V) (25a) 
= a;(Xi,...,X^2)(exp^(C/,y))L^ Ady AdP (25b) 
= (1 + 0{e^))u;{Xi, . . . ,X^2)(exp^(C/,y)) exp*(L) A dF A dP (25c) 

and therefore 

exp:(^) = (l + 0(e2))L^(Xi,...,X^2)(exp,(t/,y))dyAdP (26) 
Now, it was shown in [16] that 

ijj{Xi, . . . , X^2) = 1 — {H, Xi)xi + higher order terms, (27) 

i=d+l 



where H is a, section of the normal bundle v over Orb(IP) called the mean curvature field. The first 
order term in the above expression for cj will integrate to zero (as will all odd order terms) due to 
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the symmetry of the elhpse. Then, the area of the geodesic eUipsoidal shell C;^ is given by 

Area(ge) = / d/x = / exp*(d//) (28a) 
= [ [ {l + 0{€fMXi,...,XN2)dV AdP (28b) 

JOih{T) JEllipsc(e) 

= (l + 0(e2))e^'-'^-i / / u;iXi,...,X^2)dV AdP (28c) 

yOrb(y) JEllipsc(l) 

= e^'-'^-i VoIhs ( Orb(y)) Vol(Ellipse(l)) + 0{e^'-'^+^) (28d) 



where Ellipse(r) is the ellipse with principal axes r/\f3i\. 

So we conclude that for small enough e > 0, the volume of the ellipsoidal tube about Orb(J') is 

Vol(Te) = / Area(^,.)dr (29a) 







VoIhs ( Orb(y)) Vol(Ellipse(l)) + 0(e^ -'^^^) (29b) 



Hj llr=0 '■ ^/ /v2-d+2 



r(^ + i)niAi n.np=Vp!n,nrio"^' 



— + 0(6^-^+2), (29c) 



where d = "^nf + "^j S ^i^j — dim ( Orb(y)) . Then the volume fraction of the tube within U(A^) 



is 



VolFrac(T,) = = VolCTJ (30a) 

VoIhs (U(iV)j (27r)^-^ 

2^r(fc^ + i)niAi n,,n;=oV!n,n5o ?! 



4.3. Examples. We now return to the examples considered in Section 3.1 and compute the volume 
fractions of the corresponding near-critical sets. 

Example 4 (Maximum Submanifold of Pi^j). The Hessian of Pi^f on the maximum submanifold 
has rank 2N — 2, and all of the nonzero eigenvalues are Pi = —1. Then, the volume o/Ellipse(l) 
with principal axes is just the volume of the unit sphere S"^^^"^, which is {2N — 1) '^2N^iy.\ ~ 
(■2JV-2)! (^^^ double factorial {2N — 1)!! is defined as the product of the odd integers from 
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1 to 2N — 1). Using the volume computed in Section 3.1 for Orb(I), we find 

V„lFrac(T.) = ^ VoIhs Orb(n) yol(Empse(l)) 

^ ' 2N-2 VolHs(U(iV)) ^ ' ^ ^ 

2N-2 n^To'p! (27r)^ ^ ^ ^ ^ 

^^"7,;i^'i)^^^ o(.^-) (310 



where 



for N >2. 



1 (iV-2)! /2\^~3 1 , , 



2A^-3 (2A^-3)!! - V3/ 2iV - 3 



Example 5 (Minimum Submanifold of Pi^j). The Hessian of Pi^f on the minimum suhmanifold 
has rank 2, with both nonzero eigenvalues equal to one. So the volume of Ellipse(l) is just the 
volume of the unit sphere S'^, which is Att. Using the volume computed in Section 3.1 for Orb(lP), 
we find 

= ^VolHsOrb(J^)Vol(Empse(l)) 
^ ' 2 VolHs(U(iV)) ^ ^ ^ ' 

= t — AT^ (4vr) + 0(6^) (33b) 

2(iV-2)!nSp! (2^)^ 

= (TV- l)e2 + 0(e^). (33c) 

Example 6 (Fully Non-degenerate p and O). In the case where p and O are both fully non- 
degenerate, mi = ••• = TTiAT = 1, ni = ■ ■ ■ = = 1 and K = [P"^. Then for any critical 
submanifold Orb(J'), all of which are N dimensional tori, 

V„,F,.ac(T.) = V°lHsOrb(:P)Vol(Empse(l)) 

^ iV2_^ VolHs(U(iV)) ^ ^ ^ ^ 

nN-l I 



2^(^)!niA 
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where H I A I ^he product of the iV^ — N nonzero Hessian eigenvalues, and where 

< ...n.-°" . <^ — (35) 



5. Asymptotic Analysis 

The expression in (30b) provides a means of estimating the volume fraction of any given near- 
critical set. However, for probing the general asympotic behavior of these volumes as the dimension 
N of the state space rises, this estimate is inadequate since it only holds for small enough e > 0, 
where "small enough" is determined for each system and each dimension A^. Before seeking a new 
expression of practical utility, it is necessary to define the parameters of the desired asymptotic 
analysis. Fix some Nq > 0, Nq x Nq density matrix p", and Nq x Nq Hermitian observable operator 
0°. Then for any z G N, let = Nq + z, p"" = ® 0^, and = 0° 0;^, where 0^ is the z x z 
zero matrix. Then each critical value of the kinematic landscape Jz{U) = Ti{U p^U'^O^) is also a 
critical value of Jz+i, so each critical submanifold of Jz has a direct analog in Jz+i- In this fashion, 
one can decribe an infinite sequence of critical submanifolds as z — )• oo and consider the asymptotic 
behavior of the volume fractions of the near-critical sets around these critical submanifolds. It will 
be argued that these volume fractions converge to zero as z — )• cxd. The landscape Jz for z > Nq 
has the same number of critical submanifolds as Jn^ , so if the volume fractions of the individual 
near-critical sets converge to zero, then the total near-critical volume fraction of Jz also converges 
to zero as z — )• oo. 

Because compact Lie groups with bi-invariant metrics have non-negative sectional curvature 
[17], the following comparison theorem proved in [.i', Ch. 8] may be used to bound the volume of 
spherical tubes about a submanifold P. 

Theorem 1. Let M be an n-dimensional Riemannian manifold with non-negative sectional curva- 
ture. Then for any d-dimensional submanifold P C M and all r > 0, the volume of the spherical 
tube of radius r about P in M is bounded as 

d 



v^{r) < r [ [ t 

Jo JpJs"-'^-^ 
where H is the mean curvature vector field. 




Since the submanifolds of U(A) considered in this paper are all minimal (H = 0), this bound 
reduces to 

N'^—d 

yZ^r) < ^^Vol{S^'-'-^)Yol{OM^)). (37) 

Of course, any ellipsoidal tube with longest principal axis ^^/|/5min| is contained in the spherical 
tube with radius r = e/|/3min|- In the sequence of density matrices and observable operators p^ 
and described above, once z > Nq, the set of distinct Hessian eigenvectors for a given critical 
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submanifold does not change with z. Then /3min is fixed and the e-elUpsoidal tube about the critical 
submanifold is contained in the spherical tube with radius r = e/|/3niin|, so the volume of the 
e-ellipsoidal tube is bounded by 

N'^-d 

- (iV^-rf)|/3n.nr^-'^ ^°^^'^'' -'^-^) Vol(Orb(T)), (38) 



so that 



e 



Vol Vol(Orb(y)) 



(39b) 



where d = m? + X] "-j ~ X] ^fj dimension of the critical submanifold. 

Now, for z > Nq, the set of critical values does not change with z, so fix a critical value v 
and consider the contingency table associated with the corresponding critical submanifold. As 
z increases, the only elements in the table that will change are nis, the multiplicity of the zero 
eigenvalue of p, Ur, the multiplicity of the zero eigenvalue of O, and kgr, the degree of overlap 
between the zero eig envalues of A and T^ET, where A = Qp^^ and E = TCTt are diagonalizations 
of p and O with decreasing elements. For z > A'o, these indices change as ms{z + 1) = ms{z) + 1, 
nr{z + 1) = nr{z) + 1, and ksr{z + 1) = ksr{z) + 1. Then d[z + 1) = d{z) + 2{ms{z) + nr{z) — 
ksr{z)) + 1 = d{z) + 2(m,(iVo) + UriNo) - ksr{No)) + 2{z - No) + 1. So iVf_^i - d{z + 1) = 
iV2 _ d{z) + 2(2iVo - ms{No) - ^^(iVo) + ksriNo)). Let D%e) denote the right-hand side of (39b) 
for z > Nq. Then let 



( Nl-d{z)\ 



NJk, 



2 



/ 4jVg-d(jVo) 



+ (z-iVo)c)! NJksAzV. , 



2C|/3^i,|2C |- 4jVg-d(jVo) + + 1 _ n,(z)!m,(z)! 

where C = 2A'o — uisiNo) — nr(No) + ksr{No) > since if Vp and i^o are the nullities of and O", 
then ms(A'o) = A^'o + fp, rir^No) = Nq + vq, and ksr{No) > f p + z^o. Moreover, note that if C = 0, 
then ksriNo) = Vp + fo and the critical value under consideration must be u = 0, so for any critical 
value V / 0, C > 0. Let 

rz_ F^i^) _ [ ' +{z-l-No)C)l ^N^ + ,^^ksr{No)-No + z) 

F-\e) |^ 4jVg-dW.) + + 1 _ jVo)^) ! {rirjNo) - No + z){ms{No) -No + z)' ^ ^ 

where 

(A'o + ^)(/^sr(A'o) - A^o + z) _i , ^1 , r^f.-2 



nr{No) -No + z){ms{No) - No + z) z 



l + C- + 0{z-'). (42) 
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Then whenever C > 0; the expression in (42) converges to 1 for large z, so that \ as z — )• oo. 
Therefore F^{e) \ and consequently D^{e) \ as 2 — )• 00. Moreover, it may be seen that 
= 0(z-2C), so that F%e) = 0((z!)-2C) and D^{e) = OiUlZlisl)'^'^)- So the volume fraction of 
the spherical tube of radius e/|/3min| converges to zero as 2; — )• 00 and finally we may conclude that 
the volume fraction of the ellipsoidal tube approximating the e near-critical set about a critical 
submanifold with critical value v ^ also converges to zero as z — )• 00. Since this convergence 
proceeds very quickly as the negative power of a product of factorials, it is independent of the 
slower exponential contribution from and therefore independent of the choice of e. 

This last result demonstrates the convergence of the volume fractions of the approximating 
ellipsoidal tubes. To improve on this and show the convergence of the volume fractions of the 
near-critical sets themselves, it will be necessary to make use of the following conjecture. Evidence 
in support of the conjecture is presented in Appendix D. 

Conjecture 1. Let U G U(A^) be a critical point of J{U) = Tr{U pW O) , and let A S u(n) be such 
that the tangent vector UA £ Tjj\J{N) is of unit length and lies normal to the critical submanifold 
through U. Define f : [0,7r/(2V2)] ^ R to be the norm squared of the gradient of J along the unit 
speed geodesic in the direction UA, i.e. 

f{s) = II grad J(?7exp( sA))f = ||[exp(-s^)C/to;7exp(sA),p]||^ = \\[U^OU, exp(syl)/) exp(— sA)] , 

(43) 

and let /3mm be the minimum (in absolute value) nonzero eigenvalue of the Hessian of J at U . Then 
f{s) > (3l,^sm^V2s)/2 for all < s < 7r/(2V2). 

With this conjecture, once z > Nq, the set of Hessian eigenvalues no longer changes with z, so 
there is one fixed /3^in for all z > Nq. Then for any e < |/3min|/\/2) when r = e7r/(2|/3min|) < 
7r/(2V2), it is found that /(r) > /3l^-^sm'^{V2r)/2 > 4:/3l^i^r^ / tt'^ = e^, so that the e near-critical set 
about the critical submanifold is contained with the radius r spherical tube. For any fixed radius, 
the volume fraction of this spherical tube was shown to converge to zero as z — )• c« when the 
critical value v ^ 0. So, it is seen that if Conjecture 1 holds, then the volume fraction converges to 
zero (as the negative power of a product of factorials) for the e near-critical set about any critical 
submanifold with critical value v ^ 0. Since saddles are non-attractive critical points, this suggests 
that as z gets large, the probability becomes vanishingly small that the gradient flow from a (Haar 
distributed) random point passes through one of these flat "near-critical" regions around a saddle 
submanifold. 

Referring back to the examples from Sections 3.1 and 4.3, it may be seen that, since the transition 
probability Pi^f involves a rank one density matrix p and observable operator O, there exist 
sequences of these landscapes (for increasing N) that fit the required behavior for p'^ and 
needed for the analysis in this section. Of the two critical submanifolds of this landscape, only the 
maximum submanifold Pi^f = 1 satisfies the further condition that ( be nonzero. The remaining 
example of non-degenerate p and O for every N does not adhere to these requirements and falls 
outside this analysis. Ultimately, the asymptotic analysis of such an example in which p^ and 
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are fully non-degenerate for all N would be sensitive to the asymptotic behavior of the eigenvalues 



This work computed the volumes of the critical submanifolds of J{U) = Tr^UpWO) in the 
induced Hilbert-Schmidt measure, and developed estimates and bounds for the volume fractions 
of near-critical sets of the form = {U : \\ grad J(C/)|| < e} C U(A^). An asymptotic analysis 
of these volume fractions revealed that, when the critical value is non-zero, the volume fraction 
converges to zero as N ^ oo. This result helps to explain previous observations that numerical 
quantum optimal control experiments seem not to be adversely affected by the presence of a large 
number of high-dimensional saddle submanifolds. 

The work presented here focussed on the geometry of the kinematic landscape J : U(A^) — )• 
M. Although it is outside the scope of this paper, to relate this work more closely to numerical 
and laboratory quantum optimal control experiments, these results should be pulled back to the 
corresponding dynamical landscape J : K — t- M defined on the space of controls. This effort must 
address a number of difficulties including the dependence of these landscapes on the details of the 
quantum system and, depending on definitions, dependence on the final time T. But perhaps the 
biggest problem is that, in order to make mathematical sense of the concept of volume fractions, 
a probability measure needs to be defined either explicitly or implicitly on the control space K, 
which is typically infinite dimensional and unbounded, such as K = L'^{[0, T]; M). Overcoming these 
difficulties could provide a clearer picture of the gradient flow of J on K. 
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The first step to evaluating the volume of a compact Lie group G via Macdonald's formula [13, 14] 
is to work out it's Chevalley lattice (closely related to the concept of the Chevalley basis [ ]). 
In this appendix, we describe this lattice for groups of the form G = U(ai) © • • • © U(ab) as we will 
need to compute volumes of two such Lie groups to obtain the desired structure of the critical set 
Orb(iP). To that end, first observe that the set of all diagonal matrices in G forms a maximal abelian 
torus T, the Lie algebra of which, t, is the set of all diagonal matrices in g = u(ai) © • • • © n{ai,). 
Then defining iz such that 27rtz is the kernel of exp : t — )• T, it is found that iz = idiag(Z'*) is the 
lattice of diagonal a x a matrices with imaginary integer elements, where a = ^aj. 
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Now, each u(a;) has complexification gl(a/) which is the direct sum of sl(a/) and a one-dimensional 
abehan algebra. So the root vectors of gl{ai) are just those of sl(a/). The root space decomposition 
of sl(a/) is defined by the positive roots a\, {I < j < k < ai) and the corresponding coroots H^i 
and root vectors X^i [ ] given by 

jk 

a]j,{H) = - l<j<k<ai (44a) 

H^^.=\3){3\-\k){k\ (44b) 

jk 

X, = \j){k\ (44c) 

jk 

X,=-\k){j\. (44d) 

jk 

where H is an arbitrary element of the Cartan subalgebra of diagonal matrices in sl(a;). If, for any 
X = ^ + iri ^ s\{ai) with ^, r/ S su(ai), we define X = ^ — ir], then the Xa's above satisfy X^a = Xa 
as indicated in [ ]. Then finally let 

^k ■= (Ol.;;^^'^"^^^-"^.'^!-^) = (0,...,0,|j)(A:| -|fc)(j|,0,...,0) (45a) 

/—I zeros b—l zeros 

r], := {0,...,0,i{X, -X,),0,...,0) = {O, . . . ,0,i{\j){k\ + \k){j\) ,0, . . . ,0) (45b) 

J k J k J k 

for all 1 < / < 6 and all 1 < j < /c < a^. These vectors, along with the basis of given by 
{tj := for j = 1, . . . ,a, form the basis for the Chevalley lattice of U(ai) ® • • • © U(ab) 

denoted by Qz in [f3]. The volume of the fundamental cell g/gz with respect to a given inner 
product is just the square root of the determinant of the Gram matrix constructed from the basis 
for Qz- 



Appendix B. The Hessian of J{U) = Tr{UpU''0) 

In this appendix, the Hessian operator of J is described and its eigenvalues and eigenvectors 
obtained. First, note that, since U^U = I, 6U^U + U^6U = for any 6U E TuU{N), so 6U^ = 
-U^6UUK Let J : U(iV) ^ M be given by J{U) = TiiUpU^O). Then 

duJ{dU) = RTiidUpU^O + Up6U^O) = RTviSUpU^O - UpU^dUU^O) (46a) 
= ^Ti{{p,U^OU]uUu) = {U[U^OU,p],5U) (46b) 

so that grad J(C/) = U[U'^OU, p] G Tu\J{N) and grad J is a vector field over U(A^), i.e. grad J G 
X(U(iV)) [the C°° module of smooth vector fields on U(iV)]. For any X £ X(U(iV)), the Hessian 
of J is given by Hess(X) = Vx grad J. Because U(A^) is endowed with the Riemannian metric 
induced from the real Hilbert-Schmidt inner product on C^^^,Vx grad J = (d(grad J)(X)) [17], 
where X is an extension of X to C.^^^ and T denotes the tangential part. Now, 

du grad J{5U) = 6U[U^OU, p] + U[6U^OU, p] + U[U^06U, p] (47) 
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and therefore, 

Hess(X) =VxgradJ= (u^ X[U^OU, p] + [U^OU, p]X^U + 2[X^OU, p] + 2[U^OX, p]^ (48a) 
^-u(u^X[U^OU,p] - [U^OU,p]U^X + 2[X^UU^OU,p] + 2[U^OUU^X,p]) (48b) 



2 

lu(u^X[U^OU,p] - [U^OU,p]U^X - 2[U^XU^OU,p] + 2[U^OUU^X,p] 



2 

= u(^^[U^X,[U^OU,p]] + [[U^OU,U^X],p]^ . (48d) 

Any Hessian may be written as {Y, Hess(X)) = XYJ — (yxy)J- Note that XYJ is second order 
in J and first order in the manifold (i.e., relates to tangent spaces), while (VxY^J is first order in 
J and second order in the manifold (i.e., relates to cm'vature of the manifold), so this second term 
vanishes at critical points and on flat spaces (when X and Y are constant vector fields). In the 
above case, for left invariant X and Y, it can be shown that XYJ = (Y, U[[WOU, U^X],p]'j and 

{VxY)j = - (y, lu[Wx, [Wou,p]]). 

Suppose that f/ is a critical point of J, so that grad J(C/) = U[WOU, p] = and Hess(X) = 
U[[U^OU,U^ X], p]. Let Q diagonalize p and F diagonalize O such that the diagonal elements of 
A = npn^ and S = TOT^ are decreasing. Then U = T^VTW^n for some {V, W) £ U(m) U(n) 
and some T £ Sn ["]. We can then write 

Hess([/y) = u[[n^w?''j:'Pwn,Y],n^An] = u n^w T,'j',w'<nYn'<w],A]nwK (49) 

Then Ress{UY) = I3UY if and only if 

/3y = [[ytsy,y],A] (50a) 

PYjk = -(Aj - Afc)(<TyQ) - (Tg>(fc))yjfc (50b) 

where Y := l^^^Jiyr^^'Ty. So either /3 = —{Xj — Afc)(cjy(j) — o"y(jt)) or Yjk = 0. Then an orthonormal 
set of solutions is given by 

P = Y = i\l){l\ (51a) 

P = -(A,- - A,)(ay(,) - ay(fc)) Y = ^{\j){k\ + |A:)(j|) (51b) 

(3 = -{Xj - A,)(a:p(,) - ay(fc)) Y = -^{\j){k\ - \k){j\) (51c) 

for 1 < Z < and I < j < k < N. These (3 and the corresponding vectors UY = UO.'^WYW^Q 
are the eigenvalues and eigenvectors of Hess at the critical point U. 
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Appendix C. The Shape Operators of the Critical Submanifolds 

Consider a critical submanifold for some permutation matrix T and degeneracy structures m 
and n of O and p, respectively. Then, let Q G U(A^) and T G U(A^) diagonalize p and O, so that 
Q,pQ^ = A and FOF^ with Ai > • • • > Aat and ci > • • • > utv- The critical submanifold is then 
C = rtU(n)TU(m)Q C U(A^). At any U = T^VJ'W'fn in this critical submanifold, the Hessian 
eigenvectors identified in (51) corresponding to zero eigenvalue describe an orthonormal basis for 
the tangent space of C, TjjC, and the eigenvectors corresponding to nonzero eigenvalues form an 
orthonormal basis for the normal space of C at U, (Tf/C)"*". While such a critical submanifold 
C is not totally geodesic in general, it is the case that the geodesies along these particular basis 
directions for TjjC remain on C. To see this, first note that each of these basis vectors X has 
the property that either [?7^X, U^OU] = or [WX, p] = 0. This follows from the fact that, when 
X = W'fnWxn^W] G u(A^), either X = i\l){l\ for some / = 1, . . . , so that X commutes with both 
WOU and p, or A = z\j){k\ — z\k){j\ for some \z\'^ = 1/2 and j < k with either Xj = or = 
(Ty(fc). When U'lX commutes with U^OU, then TAyt commutes with S, so that TXT'i G u(m). 
Then the curve 7(5) = e^^^^'U = ^We^'^^'^'^'' 'PW'^^l is a geodesic in C in the direction 7'(0) = X, 
which can be thought of in terms of the path through U(m) © U(n), {V{s),W{s)) = {Ve^'^'^'^\w). 
Likewise, when WX commutes with p, then X commutes with A, so that X G u(n). Then the 
curve 7(5) = Ue'^^'''^ = F^VPe'^^W^Q is a geodesic in C in the direction 7'(0) = X, which can 
be thought of in terms of the path through U(m) © U(n), {V{s),W{s)) = (V, We~^^). Note that, 
while the geodesic curve is uniquely defined for a given basis vector A, the corresponding path 
through U(m) ©U(n) is not unique because of the ambiguity in associating {V, W) G U(m) ©U(n) 
to a given U £ C afforded by the stabilizer subgroup Stabu(m)eu(n)(^) — U(K). 

For any two of these basis vectors for TjjC, say A and Y, the second fundamental form ^(A, Y) 
at U is defined to be (Vj^l")^, where A and Y are smooth local extensions of A and Y. This can 
also be calculated by extending Y along the geodesic in the direction of A and taking the normal 
part of the covariant derivative of this field. Letting Y = W^QWYQ^W G u(A), when U^X 
commutes with WOU, the vector Y can be extended along the geodesic 7(5) = e^'^'^^'U by Y{s) = 
-f{s)n^WYW''n = rWe'^^^^^TYW^n = T^VPe'^YW^'n where the path through U(m) © U(n) 
{V{s), W{s)) = {Ve^'^'^'^\ W) is used to define the extension. Likewise, when U^X commutes with 
p, the vector Y can be extended along the geodesic 7(5) = Ue'^'^'''^ by Y{s) = T'^VI'Ye'^'^W^Q, 
by using {V{s),W{s)) = {V,We~''^). Since the paths through U(m) © U(n) are not uniquely 
defined, these extensions of Y along 7 are not unique. Indeed, this is clear in the cases where U^X 
commutes with both WOU and p, as the two extensions of Y described here are different for any 
Y such that A] 7^ 0. However, since the second fundamental form is tensorial [iu], the choice 
of extension of Y will ultimately not be important. 

The covariant derivative of Y{s) can be computed simply as the ordinary derivative of Y (as a 
path through C^^^) followed by projection down to the tangent space T^(g)U(A^). Then, using 

to denote the tangential component, the covariant derivatives for the particular extensions Y{s) 
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defined above are 



TW'J'XYW'<nj for [C/tX, U'fOU] = 

s=o/ I l^rW^YXw^ny for [Wx,p] = o 
I mv'J'Yxw^n - c/f]t^txy?tytrc/j for [Wx, p]=o 

Ir^VPiX, Y]W^n for [U^X, U^OU] = 
-^rW'?[X,Y]W^n for [?7tX,/9] = 0. 



(52a) 

(52b) 
(52c) 



Then S{X, Y) = (Vj^y)"*" is the normal component, where the normal space (T^/C)"*" is spanned by 
the vectors Z = F^VPZW^Q where Z = z\j){k\ — z\k){j\ such that |zp = 1/2 and j and k are such 
that Xj 7^ Xk and 7^ <7y(fc)- Now, observe that if X = and Y = i\m){m\, then [X,y] = 0. 

And if X = i\l){l\ and Y = w\j){k\ - w\k){j\ for j < k, then [X,Y] = {6ji - 6ki){{iw)\j){k\ - 
{iw)\k){j\) so that V^Y G T[/C. So whenever X is such that X = S{X,Y) = for ah 

Y G T[/C. Moreover, if X = v\l){m\ — v\m){l\ and Y = w\j){k\ — 'w\k){j\, then 

[X..Y]=6jrn{vw\l){k\-m\k){l\)+6krn{vw\j){l\-Vw\l){j\) 

+ dki{vw\m){j\ — vw\j){m\) + 6ji[vw\k){m\ — vw\m) {k\Y (53) 

Consequently, if U^X and U^Y both commute with U'^OU, then Vj^y from (52c) also commutes 
with WOU and therefore lies in TuC, so that S{X,Y) = (V^y)"^ = 0. Likewise, if WX and 
U^Y both commute with /), then Vjj^y commutes with p so that again S{X,Y) = (Vj^y)"*" = 0. 
So among these basis vectors for TijC, the only pairs X and Y for which ^(X, y) can be non-zero 
are those in which one of the pair {U"^ X,U^Y} commutes with U'^OU, but not with p, and the 
other commutes with p, but not with U'^OU . Then the basis vectors for T^/C can be divided 
into three categories: (1) those for which U'^ X commutes with U^OU but not p, of which there 
are '^mf — ^ kf^; (2) those for which U'^X commutes with p but not U^OU, of which there are 
^ — ^ kfj-, and (3) the remaining ^ kfj basis vectors for which U^X commutes with both U^OU 
and p. 

Taking any normal vector Z G (T^/C)"*", the shape operator (or Weingarten map) Az ■ 
TfjC — )• T[/C, defined by {AzX,Y) = {S{X,Y), Z) , can be represented in block form using these 
three categories of tangent vectors as 

"0 BO 

Az= (54) 



where B is an (^ m? — ^ kfj) x (^ "-j ~ S ^fj) matrix. The eigenvalues of Az are the principal 
curvatures of the submanifold C at the point U with respect to the normal vector Z. Because of the 
indicated block structure, the nonzero eigenvalues of Az come in positive- negative pairs ±77, where 
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1] is a nonzero eigenvalue of B B. Moreover, these eigenvalues can be bounded by observing that 
the squared Hilbert-Schmidt norm = J2j<m '^T' (^gmij + kp^um) where Xm = Ag„, Xj = Xp., 
(Ty(j) = , = CTu^ , and Ai > • • • > As and ai > ■ ■ ■ > dr are the distinct eigenvalues of p and 

O with multiplicities mi, . . . , rus and ni, . . . , n^. Most importantly, Tr{Az) = for every normal 
vector Z, so that the mean curvature vector field [IG] H = S{Xi,Xi) for any orthonormal basis 
{Xi} of T[/C, is zero. Therefore every critical submanifold of J{U) = Tr{U pU^ O) is a minimal 
submanifold of U(iV) [ 'n]. 



C.l. Examples. We now return to the examples considered in Sections 3.1 and 4.3 and compute 
the shape operators of the critical submanifolds. 

Example 7 (Maximum Submanifold of Pi^j). From the contingency table for Pi~^f = 1 (see 
Section 3. 1 ), it is clear that ^ m? — ^ kf^ = S ~ S ^fj — ^' ^^^^ matrix B in the shape 
operator Az zs x 0. Therefore Az is always the zero operator for all normal vectors Z, meaning 
that this critical submanifold is totally geodesic [:iOj. 

Example 8 (Minimum Submanifold of Pi^f). Consider the case where o"y(2) = 1 o,nd cry^^j-^ = 
for all i 7^ 2. The first category basis vectors are then Xj = V'^V'J'XjW^O,, where Xj = Wj\l){j\ — 
Wj\j){l\ for j = 3, . . . , N andforwj = ^ andwj = Likewise, the second category basis vectors 
are Yj = rWJ'YjW^Q forYj = Vj\2){j\-Vj\j){2\ forj = 3,..., N, and for vj = ^ andvj = So 
Vj^ Yk = 5jfe^r"l'yj'[Xj, Y^']VF"''r2, where [Xj,Yj] = —WjVj\l){2\ + iIjjVj\2){l\. Then for any normal 
vector Z = VWyZW'^VL with Z = z\l){2\ -z|2)(l|, {V^Y,Z) = \ {[X,Y],Z) = -^{wjVjz), so the 
(2N — 4) X (2A^ — 4) matrix B in the shape operator Az is 



B = —-In-2 



(55) 



Then B^B = ■^Il2Af-4; so that Az has eigenvalue = -y- with multiplicity 2N — 4, r/ = — ^ with 
multiplicity 2N — 4, and rj = with multiplicity N'^ — AN + 6. When Z is normalized with respect 
to the Hilbert-Schmidt norm, l-zp = 1/2, so the nonzero eigenvalues of Az are rj = i^;^- 

Example 9 (Fully Non-degenerate p and O). In the case where p and O are both fully non- 
degenerate, mi = ■■■ = m]\! = 1, m = ■ ■ ■ = n]\f = 1 and K = CP^. Then for any critical 
submanifold Orb(J'), Yl "^i ~ Yl ^ij —Yl^'j~Yl ^ij — ^'^ ^^^^ matrix B in the shape operator 
Az is 0x0. Therefore Az is always the zero operator for all normal vectors Z, so that every critical 
submanifold of J is totally geodesic when p and O are fully non- degenerate. 
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Appendix D. In Support of Conjecture 1 

In this appendix we build the case for Conjecture 1. Let U = T^VTW^Q be a critical point, 
where (V^, H^) € U(m) © U(n) and r2,r G U(A^) diagonalize p and O, respectively, i.e. QpQ^ = A 
and rort = S. Also let A = Y.i=i mA, where ^ 1, i; = zi\ji){ki\ - zi\ki){ji\, jz^p = 1/2, 

and {ji, A;2, . . . , jL, /cl} is a set of 2L distinct indices in {1, ... , N} such that the Hessian eigenvalue 
Pj^ki = —{^ji — ^feJl'^J'O;) ~ ^y(fci)) nonzero. As shown in Appendix B, for the Hessian at a 
critical point, every eigenspace corresponding to a nonzero eigenvalue is spanned by eigenvectors of 
the form z\j){k\ — z\k){j\ with j < k. Then exp(sA)pexp(— sA) = fi^VF exp(s^) A exp(— 5^)11^^17. 
Now, defining = zi\ji){ki\ + zi\ki){ji\ and = ^ aiA^ ^ it is easily verified that 

L 

[i,A] = ^(Afc, -A,Jazi+ (56a) 
1=1 

[A,Ai]=ai{\ji){ji\-\ki){ki\) (56b) 
[A,\ji){ji\-\ki){ki\] = -2aiAi, (56c) 

so that [i, = -2afAi, and [i, [i, Ui)(ii| " l^«>(^d]] = -2a?(|ji)(jH " Then, using 

the notation adx for the adjoint operator a.dxiY) = [X,Y\ [;9], 

~ s'"ad'J(A) 

exp(s^)Aexp(-sA) = exp(sad^)A = 2^ y (57a) 

m=0 

^ adt' (A) ^ ad^"+i (A) 

m=l ^ m=0 ^ ' 

= A + 5: a?(A., - A,) — ^ f "^^^'^ 

Z=l m=l ^ 

L oo^ g2m+l ad^™(i+) 

«=1 m=0 ^ ^ 

^ °^ ( e'\2m/'_9\m— 1 

= A + j;(A,, - A,J ^ ^^\y^(U^)0-^l - 

/=! m=l ^ '' 

i=l m=0 ^ ^ 

= A + ^ ^^^^(1 - cos(^/2a^5)) (|j,)(jH - 

+ E sin(^/2a^.)i+ (57e) 
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SO that 



1 e pe 



2 



V2 



sin(\/2a;s) [y'^ST, Af] ) VF+j^ 



5] sm(V2a,.)A, 



(58a) 

(58b) 
(58c) 
(58d) 



where Pjk is the Hessian eigenvalue corresponding to the eigenvector U(yz\j){k\ — z\k){j\). It 
follows that, for this choice of A, f{s) = /3|^^^ sin^(\/2Q;/s)/2 for all s. Now, let q{s) = 
sin2(V2a/s) - afs\\?{V2s). Then q'{s) = \/2ai sm(2^/2ais) - ^/2af sm{2^/2s) , and q"{s) = 
Aa.1 cos {2^f2ais) - cos(2^/2s)] . Since g'(0) = and q"{s) > for all s G [0, 7r/(2^/2)], q'{s) > for 
all s E [0,7r/(2^/2)]. And since q{0) = 0, this implies that q{s) > for ah s G [0, 7r/(2\/2)]. There- 
fore, /(s) > X;//5|,fc,"z^sin2(\/2s)/2 > ^'^■^sm'^{V2s)/2, since E"? = 1- So for ah unit normal 
vectors UA of this form, f{s) satisfies the conjecture. Moreover, when A = z\j){k\ —z\k){j\ for some 
j 7^ k, f{s) = Pjf^ sin^(\/2s)/2, and in the particular case when j and k are such that U A is an eigen- 
vector corresponding to the smallest eigenvalue (in absolute value), then f{s) = sin^(-v/2s)/2 
for all s, so this conjectured lower bound is attained. 

Now, for that same critical point U = rW'PW''n, let = AA(Hess)-^ C T[/U(7V) be the orthogonal 
complement of the null space of the Hessian at U, i.e. the span of the Hessian eigenvectors corre- 
sponding to non-zero eigenvalues. Then let S C U^Z C u(A^) be the unit sphere within Z. For 
any s G [0, 7r/(2V2)], let G : § ^ M be given by Cs{A) = \\ grad J(C/exp(sA)) |p. Then, using the 
integral expression for the derivative of the matrix exponential [21, App. B], the differential of 
is found to be 



dACsiSA) = 2 ( e-''^U^OUe'^,p 
-1 



1 





^srA 



2s{ I e 
2s ( 



e-^"^dr, 6 A 



(59a) 
(59b) 



ytsy, ytsy, e'^Ae"'^ , e'^Ae""^ e''^^-''^'^ dr, W^nSAQ^W^ , 

(59c) 
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where A = W^^IAQ^W . When A = aiAi with Ai = zi\ji){ki — zi\ki){ji\ as above, we have seen 
that [Ttsy,e"^Ae-"^] = J2i l^hk sm{V2ais)Ai/V2. Then 



so that 



^ ^ sin{V2ais)e'^ [A, A] e"^^ 
z=i 

J^a^Afe, -A,J^sin(^/2a;.)e^^i+e-^^ 



L 



-'31^1 



m=0 



J^aKA.. - A,J^sin(^/2a,.) J] — 

Z=l m=0 ^ ' 



+ ^a,(Afc,-A,J%sin(^/2a,.)^ 



oo „2m-|-l 



adf+i(^+) 



f2m + 1)! 



m=0 

' \2m( n\Tn 



(2m)! 



1=1 



-sil-r)A 



sA 



-sA 



-Y.aif^sm{2V2ais)Ai 
1=1 



(60a) 
(60b) 
(60c) 



(60d) 



+ Y^aiiX,,-X,,)^smiV2ais) J] ^ {\ji){ji\ " (60e 

1=1 ^ m=0 

E«KA.,-A,J^ 



-^sin(2\/2azs)i+ + sin2(\/2azs)(|jz)(jH - \ki){ki\)^ , (60f) 



(61a) 
(61b) 



because Ai commutes with A for ah / under the stated assumptions. Since the domain of is 
the unit sphere § C W Z C u(n), A is is critical point of Cs if and only if dACsi^A) is zero for all 
6A perpendicular to A, i.e. if and only if (61b) is proportional to ^ = Yl'^i^i- So for a given 
s there exist many critical points of Cs among these A matrices, in particular the cases in which 
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A = z\j){k\ — z\k\){j\ are critical for all s. If a more detailed critical point analysis could be 
performed for Q with s E [0, 7r/(2\/2)], it might lead to a confirmation of Conjecture 1. 

In addition to these analytic results that hint at the conjecture, more than one million numerical 
simulations have been performed to test it. In each simulation, random degeneracy structures 
were sampled for p and O, random eigenvalues chosen, a random critical point and a random 
normal vector selected. Then f[s) was computed over the interval [0, 7r/(2-v/2)] and compared to 
the function j3'^-^sm'^{V2s)/2. This was done for different system sizes including = 6, = 17, 
N = 25, and 1000 trials with = 256 (which take much longer to run). In every case Conjecture 
1 was satisfied. So seemingly, either the conjecture is true, or any counterexamples that exist are 
hard to find. 
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